Automatic Construction of a TMF Terminological Database using a Transducer Cascade
نویسندگان
چکیده
The automatic development of terminological databases, especially in a standardized format, has a crucial aspect for multiple applications related to technical and scientific knowledge that requires semantic and terminological descriptions covering multiple domains. In this context, we have, in this paper, two challenges: the first is the automatic extraction of terms in order to build a terminological database, and the second challenge is their normalization into a standardized format. To deal with these challenges, we propose an approach based on a cascade of transducers performed using CasSys tool of the Unitex linguistic platform that benefits from both: the success of the rule-based approach for the extraction of terms, and the performance of the TMF standard for the representation of terms. We have tested and evaluated our approach on an Arabic scientific and technical corpus for the Elevator domain and the results are very encouraging.
منابع مشابه
Automatic Construction of Persian ICT WordNet using Princeton WordNet
WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...
متن کاملAn abstract model for the representation of multilingual terminological data: TMF – Terminological Markup Framework
Nous présentons un modèle abstrait de représentation de terminologies multilingues informatisées en XML défini dans le cadre du comité technique 37 de l'ISO. Il repose sur une méthodologie qui distingue d'une part la structure générale d'une base terminologique, et d'autre part les informations (catégories de donnée) qui servent à décrire les différents niveaux de cette structure. Summary. We a...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملLexTerm Manager: Design for an Integrated Lexicography and Terminology System
We present a design for a multi-modal database system for lexical information that can be accessed in either lexicographical or terminological views. The use of a single merged data model makes it easy to transfer common information between termbases and dictionaries, thus facilitating information sharing and re-use. Our combined model is based on the LMF and TMF metamodels for lexicographical ...
متن کاملFuzzy PD Cascade Controller Design for Ball and Beam System Based on an Improved ARO Technique
The ball and beam system is one of the most popular laboratory setups for control education. In this paper, we design a fuzzy PD cascade controller for a ball and beam system using Asexual Reproduction Optimization (ARO) technique. The ball & beam system consists of a servo motor, a grooved beam, and a rolling ball. This system utilizes a servo motor to control ball’s position on the beam. Chan...
متن کامل